Modelling the perception of simultaneous semi-vowels

نویسندگان

  • Georg F. Meyer
  • William A. Ainsworth
چکیده

A model that is able to predict human performance in a simultaneous glide recognition task is described. The model combines a primitive, F0 guided, segregation stage and a schema driven stage with a heuristic that models whether listeners perceive a single or two simultaneous sounds. Introduction Previous studies [1,2,3] suggest that human listeners use simple cues, such as signal harmonicity, speaker location or segmental onset and offset to aid in the segregation of simultaneous sounds. These cues are called ‘primitive’ grouping cues because they can be applied without prior knowledge. The only heuristic is that segments in an ‘auditory scene’ that share the same features are likely to be produced by the same speaker. In addition to the primitive segregation process human listeners use high-level knowledge, schemata, to deal with mixtures of sounds [1]. One of the most intensively studied primitive grouping cues is harmonicity. Figure 1 shows human performance for a recognition task involving simultaneous vowels. Each of the panels shows the percentage of pairs that listeners correctly recognise. The stimuli were pairs of the French long vowels /#,G,K,Q,W,[/. One of the vowels always had a fundamental frequency (F0) of 100Hz, the fundamental frequency of the second vowel is plotted along the x-axis. The only primitive segregation cue is the vowel fundamental frequency. The three panels show subject performance for vowels of 200ms, 100ms and 50ms duration. For signals of at least 100ms duration subject performance improves significantly as the frequency difference between the vowels increases. If signals are only 50ms long no improvement in performance is seen. The perceptual data is surprising considering the dynamic nature of speech sounds where stationary segments of more than 100ms duration are very rare. Another important feature that emerges from the data is that humans are able to recognise both constituents of a pair in around 65% of all cases independent of the signal duration and without any grouping cues. 100106112 126 2nd vowel F0 40 50 60 70 80 90 % p ai rs c or re ct 204.8ms 100106112 126 2nd vowel F0 40 50 60 70 80 90 Human Performance

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

مشکلات جداسازی اصوات گفتاری همزمان در کودکان کم شنوا

Objective: This study was a basic investigation of the ability of concurrent speech segregation in hearing impaired children. Concurrent segregation is one of the fundamental components of auditory scene analysis and plays an important role in speech perception. In the present study, we compared auditory late responses or ALRs between hearing impaired and normal children. Materials & Methods...

متن کامل

The Study of Vowel Space and Formant Structure in Mazani Language

Objective: One of the parameters showing the correct phonetic and phonological development is the correct and clear articulation of vowels is achieved by changing the shape of vocal cords through altering the height and position of the tongue and the movement of the lips and jaw. The tongue’s height and position are the basis of the production and difference of vowels. In other words, the raw s...

متن کامل

The perceptual segregation of simultaneous auditory signals: pulse train segregation and vowel segregation.

In the experiments reported here, we attempted to find out more about how the auditory system is able to separate two simultaneous harmonic sounds. Previous research (Halikia & Bregman, 1984a, 1984b; Scheffers, 1983a) had indicated that a difference in fundamental frequency (F0) between two simultaneous vowel sounds improves their separate identification. In the present experiments, we looked a...

متن کامل

Neural Representation of Concurrent Vowels in Macaque Primary Auditory Cortex123

Successful speech perception in real-world environments requires that the auditory system segregate competing voices that overlap in frequency and time into separate streams. Vowels are major constituents of speech and are comprised of frequencies (harmonics) that are integer multiples of a common fundamental frequency (F0). The pitch and identity of a vowel are determined by its F0 and spectra...

متن کامل

Acoustic Analysis of Persian EFL Learners' Pronunciation of English Vowels

This paper reports the results of an experimental study on non-native production of English vowels. Two groups of Persian EFL learners varying in language proficiency were tested on their ability to produce the nine plain vowels of American English. Vowel production accuracy was assessed by means of acoustic measurements. Ladefoged and Maddison’s (1996) F1 F2 measurements for American English v...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997